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HIERARCHICAL DATA EXTRACTION 

This invention relates to the extraction of 
hierarchical data from a database and, in particular, to 
5 the automatic generation of structured query language (SQL) 
code to achieve this. 

Data are extracted from databases typically in one of 
two ways. The first way involves constructing a query, 
such as an SQL statement, to present to the database for 
10 execution- The query identifies to the database which data 
are required for extraction. 

The second method involves using a query tool such as 
Oracle Discoverer which, in simple terms, represents a 
graphical user interface between the user and the database. 
15 The query tool receives user input in a simplified format 
and generates the query, for example in SQL, to present to 
.the database and then presents the retrieved data to the 
user . 

A problem exists with both these methods when they are 
2 0 used to extract hierarchical data from a database since it 
is extremely complicated for a user to generate the 
required SQL code to extract the data and query tools are 
unable to generate the special SQL statements required. 

In accordance with one aspect of the present invention 
25 there is provided a computer implemented method for 
extracting hierarchical data from one or more tables in a 
database, the method comprising: 

a. receiving user input identifying, at least, one 
or more tables in the database that contain the 

30 hierarchical data and the hierarchical structure 

of the hierarchical data; 

b. generating a Structured Query Language (SQL) 
statement using the user input received in step 
(a) ; 

35 c. presenting the SQL statement to the database for 

execution; and. 
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d. extracting the hierarchical data from the 
database in response to the execution of the SQL 
statement . 

Hence, the invention provides a method for 
5 automatically generating the complex SQL required to 
extract hierarchical data from a database. The complexity 
involved is hidden from the user who in fact requires no 
knowledge of SQL in order to use the invention. 

Typically, the user input identifying the hierarchical 
10 structure indicates at least a column of one of the 
database tables that represents the parent items of the 
hierarchical structure and a column of one of the database 
tables that represents the child items of the hierarchical 
structure. 

15 In this case, the data value that is the top of the 

hierarchical structure may have a default value of NULL. 
Alternatively, the user input identifying the hierarchical 
structure may indicate the data value in the column 
representing the parent items that is the top of the 

2 0 hierarchical structure. 

The user input identifying the hierarchical structure 
may also indicate at least one further column of one of the 
tables that further restricts the hierarchy. 

The method may also be used to perform aggregation of 
25 values associated with the hierarchy. As such, prior to 
generating the SQL statement, further user input may be 
received, the user input indicating at least one column of 
one of the database tables on which leaf sum aggregation 
should be performed, 
30 Alternatively, or in addition, prior to generating the 

SQL statement, further user input may be received, the user 
input indicating at least one column of one of the database 
tables on which tree sum aggregation should be performed. 

In a preferred embodiment, the SQL statement is 

3 5 generated from the user input by: 
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i. constructing object SQL that will return 
all the database items identified by the 
user input ; 

ii- constructing object SQL that cal'culates the 
5 level of ah item in the hierarchical 

structure ; 

iii. constructing object SQL that calculates the 
number of child items appendant to an item 
in the hierarchical structure; and, 
10 iv, combining the object SQL of steps (i) to 

(iii) in a single object SQL statement. 
When the SQL statement is generated from the user 
input in this way and leaf sum aggregation is to be 
performed, further object SQL that performs the leaf sum 
15 aggregation may be constructed and this further object SQL 
may be combined with the object SQL of steps (i) to (iii) 
in step (iv) . 

Alternatively, or in addition, when the SQL statement 
is generated as described and tree sum aggregation is to be 

2 0 performed, further object SQL that performs the tree sum 

aggregation is constructed and this further object SQL is 
combined with the object SQL of steps (i) to (iii) in step 
(iv) . 

The invention will typically be provided as a computer 
25 program comprising computer program code means adapted to 
perform the steps of the first aspect of the invention when 
said program is run on a computer . 

Further, there may be provided a computer program 
product comprising program code means stored on a computer 

3 0 readable medium for performing a method according to the 

first aspect of the invention when said program product is 
run on a computer. 

Some embodiments of the invention will now be 
described with reference to the accompanying drawings, in 
35 which: 

Figure 1 shows a table in a database defining a pig's 
ear hierarchy; 
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Figure 2 shows the hierarchical structure of the 
hierarchy defined by the table of Figure 1; 

Figure 3 shows a pair of database tables defining a 
simple matrix hierarchy ; 
5 Figure 4 shows the contents of one of the tables 

defining the hierarchy of Figure 3; 

Figure 5 indicates the structure according to the 
hierarchy of .Figures 3 and 4; 

Figure 6 shows a set of tables defining a complex 
10 matrix value hierarchy; 

Figure 7 shows the contents of the ORG HIER table _of 
Figure 5; and. 

Figure 8 indicates the structure of the hierarchy 
according to Figures 6 and 7 . 
15 In order to describe the embodiments of the invention 

fully, it is worthwhile first describing two types of 
hierarchical structure with which the invention may be 
used. It is important to realise however, that the 
invention is not restricted for use with only these types 

2 0 of hierarchical structure and a skilled person will realise 

that its use can be extended to other structures. 

Whilst the embodiments below are presented with 
reference to "tables", it will be understood that these can 
also be thought of as database "views". 
25 Figure 1 shows a table designated EMP having eight 

columns under which data relating to the employees of a 
company are listed. Of particular interest are the columns 
EMPNO, ENAME and MGR. The values in the ENAME column are 
the names of the individual employees whilst the EMPNO and 

3 0 MGR columns gives their employee numbers and the employee 

numbers of their manager respectively. 

Each row in the table has a unique value of EMPNO and 
a corresponding value of MGR. This value of MGR refers to 
another value of EMPNO since the EMPNO and MGR columns are 
3 5 joined. For example, the value of EMPNO for the employee 
named SMITH is 7369 and the corresponding MGR value is 
7902. This MGR value corresponds to EMPNO value of 7902 
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which is that of the employee named FORD. In essence, this 
table defines the hierarchical structure of the company. 
Such a hierarchy is known as a pig's ear hierarchy and is 
characterised by a join from one column of a table to a 
5 different column of the same table. The main disadvantage 
of this kind of hierarchical structure is that the 
structure itself is mixed in with other data and so, 
supporting more than one hierarchy requires duplication of 
data . 

10 The resultant structure described by this table is 

shown in Figure 2 in which it can be seen that the employee 
with an ENAME value of KING is the top of the hierarchy 
(with an MGR value of NULL) . The employees with ENAME 
values of JONES, BLAKE and CLARK all have corresponding MGR 

15 values of 783 9 and this corresponds to the EMPNO value of 
KING and so these three employees form the second level of 
the hierarchy as shown in Figure 2. Subsequent levels of 
the hierarchy are built up in the same manner. 

The second type of hierarchical structure that will be 

2 0 described is a matrix hierarchy. This overcomes the 

disadvantage of pig's ear hierarchies since the hierarchy 
data is separated from the other data in its own hierarchy 
table. This allows for an unlimited number of hierarchical 
structures to be created on the same data therefore 
25 enabling, amongst other things, version control of 
hierarchies and complex data modelling. 

An example of a matrix hierarchy is shown in Figure 3 
which shows two tables named EMP and ORG. Only the columns 
that exist in each table are shown in Figure 3. The 

3 0 associated data are not shown. The table EMP is analogous 

to that shown in Figure 1 whilst the table ORG also has the 
columns EMPNO and MGR but has a second column known as 
ORGNAME. In this example, the EMPNO column of table EMP is 
joined to the corresponding EMPNO column of table ORG which 
35 has corresponding values. The corresponding values of MGR 
in table ORG are then joined back to the EMPNO column of 
table EMP. The third column ORGNAME stores the name of the 
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organisation and therefore allows for multiple hierarchies 
to be created, for example where the same employees may be 
working on different projects simultaneously then the 
ORGNAME column could refer to these different projects. An 
example of data that may be stored in this table ORG is 
shown in Figure 4 which shows two different organisations 
listed under the ORGNAME column with the names ORG A and 
ORG X. The values of EMPNO and the corresponding MGR 
values are listed next to these in order to define the 
hierarchies. The hierarchical structure according to this 
matrix example is shown in Figure 5. 

A matrix hierarchy may be used for more complicated 
hierarchical structures than simple parent -child 
relationships as previously described. For example. Figure 
15 6 shows a set of tables defining complex matrix hierarchy. 
In this example, the ORG 'hIER table is similar to the ORG 
table of Figure 3 but has three further columns ORGHIER_ID, 
VER_ID and BUS_GRP_NO. The ORGHIER_ID column is a unique 
key for each row of this table. The VER_ID column is 
20 joined to the corresponding column in a table known as HIER 
VERSION which has further columns known as VERSION NUMBER 
and STRUCT_ID. This latter column is further joined to a 
corresponding column in a table known as HIER STRUCTURE 
which has a second column known as STRUCTURE NAME. The 
25 BUS_GRP_NO column of table ORG is joined to a corresponding 
column in a table BUSINESS GROUP. As such, hierarchies 
having multiple version numbers within multiple 
hierarchical structures and all within multiple business 
groups can be described. Further details of the ORG HIER 
3 0 table are provided in Figure 7 whereas the overall 
hierarchy according to this example is shown in Figure 8. 

This invention is concerned with the extraction of 
hierarchical data from tables in a database that contain 
.such data. The data required may simply be the level in 
35 the hierarchy which an item occupies or the number of 
subordinates that each item in the hierarchy has. Further, 
the invention can also perform aggregation functions. 
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In particular, the invention can calculate the tree 
some and leaf sum aggregates for a hierarchical structure. 
These aggregates are explained below by example. 

Referring to Figure 2, the leaf sum and tree sum SAL 
5 aggregates for the employee called SMITH are 800. They 
equal the value of SAL for SMITH since he has no 
subordinates. SMITH reports to FORD for whom the leaf sum 
aggregate is also 8 00 being equal to the sum of the values 
in the SAL column for each of FORD's subordinates, in this 
10 case SMITH. The value of SAL for FORD is 3 000 and 
therefore the tree sum aggregate of SAL for FORD is 3800. 

FORD reports to JONES for whom the leaf sum aggregate 
of SAL is 1900 being equal to the sum of the values in the 
SAL column for each of JONES' immediate subordinates, FORD 
15 and SCOTT, only. The values in the SAL column for SMITH 
and ADAMS are relevant to the leaf sum aggregate of JONES. 
The tree sum aggregate for JONES is the sum of the values 
of the SAL column for JONES and all his subordinates. 

In order to generate an SQL statement for extracting 

2 0 the data required, a user is first required to input 

various items to indicate, amongst other things, the table 
that holds the hierarchy data and the columns in the table 
that hold the parent and child identifiers, thereby 
identifying the hierarchical structure. The data may be 

25 input in various ways. For example, it may be input 
through a graphical user interface or alternatively, it may 
form the rows of a database table which are referred to by 
software performing the invention. 

The entry of the user input and the subsequent 

30 construction of the SQL object for retrieving the 
hierarchical data will be described with reference to two 
examples. The first example. Example 1, is based on the 
simple pig's ear hierarchy of Figures 1 and 2 and the 
second example, Example 2, is based on the complex matrix 

3 5 hierarchy shown in Figure 6. 

The first three user input items that must be received 
by the software for performing the invention are mandatory. 



m 



8 

These items are respectively the name of the table that 
holds the hierarchy data (referred to as User Input 1) , the 
name of a column on the table that holds the parent items 
(referred to as User Input 2) and a column on the table 
5 that holds the child items (referred to as User Input 3) . 
If only these data items are input by the user then the 
software according to the invention will merely extract the 
level and number of subordinates for each parent item in 
the table. 

10 The remaining items of user input are all optional. 

The first optional item is a start value that identifies 
the data value in the parent items that indicates the top 
of the hierarchy (referred to as User Input 4) . This has 
a default value of NULL if no user assigned value is 

15 provided. The next user input item (referred to as User 
Input 5) is used to identify another column in the table 
that further restrict the hierarchy. If none exists then 
this item is not provided. Alternatively, the user can 
enter multiple columns for this user input item. 

2 0 The next optional user input item (referred to as User 

Input 6) identifies the columns of the table or view that 
contain numerical values for which the user wishes to 
perform tree sum or leaf sum aggregation. If there are 
none then this item is not provided. Alternatively, the 

25 user may enter multiple columns for this item. 

The final item (referred to as User Input 7) 
identifies the column that uniquely identifies the rows of 
the table that contains the child and parent items. For 
example, this may be the primary key. If there is no 

30 column on this table that does this then. this item is not 
provided. In this case, the unique identifier for the row 
will be formed from a composite key consisting of the child 
item and the items of User Input referred to by User Input 
5. 

3 5 The table below shows for the purposes of Example 1 

and Example 2 the values that are input for each of these 
user input items with respect to both. examples : 
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User Input Item 


Example 1 


Example 2 


User Input 1 


EMP 


ORG_HIER 


User Input 2 


MGR 


MGR 


User Input 3 


EMPNO 


EMPNO 


User Input 4 




0 


User Input 5 




VER_ID , BUS_GRP_NO 


User Input 6 


SAL, COMM 




User Input 7 


EMPNO 


ORGHIER^ID 



10 The construction of the SQL object for extracting the 

level in the hierarchy for each item, the number of 
subordinates that it has and tree and leaf sum aggregation, 
if appropriate, can be broken down into the generation of 
the following distinct SQL segments . 

15 Firstly, an initial SQL statement is constructed that 

selects the database items identified from the user input. 
This SQL object is then used in the creation of the other 
SQL segments. 

The first SQL segment that is constructed is that 
20 which performs the aggregation of the user identified 
numerical data for leaf and tree sum aggregation. 

The next segment is object SQL that calculates the 
levels for each item in the data hierarchy. 

The next segment is object SQL that calculates and 
25 extracts the number of subordinates for each item in the 
hierarchy. 

Object SQL is then generated that combines all of the 
above SQL segments into a single statement for execution on 
the database in order to extract the results required. 

3 0 The combined SQL statement may be turned into a SQL 

object in the form of a database view. This is described 
later with reference to Example 2 . 

In order to construct the initial SQL statement 
referred to above, the users input (as shown in the above 

35 table) is taken and a basic SQL object is built from it. 
If no values exist for User Inputs 5 or 6 then they are 
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excluded from this SQL object. If more than one column has 
been provided for User "inputs 5 or 6 then they are all 
included in a comma separated format. If the value 
specified for User Input 7 has already been specified in 
5 any of the other user input fields then User Input 7 is not 
included in the construction of the SQL object. For 
instance, in example 1 the value of User Input 7 is the 
same as the value for User Input 3. Once the SQL object 
has been constructed it can be validated against the 
10 database. This object is used in the generation of all the 
other SQL segments. 

The following SQL statement is used to construct the 
basic SQL object: 

15 (SELECT ' I I User_Input2 | | ' , ' | | User_input3 ] | ' , 

' I |User_input5 | | ' , ' \ |User_input6 | | » , ' | lUser_input7 | | • 
FROM • I I User__inputl | | ' ) 

This provides the following SQL statements for Example 1: 

20 

(SELECT MGR, EMPNO, SAL, COMM FROM EMP) 
and for Example 2 : . 

25 (SELECT MGR, EMPNO, VERBID, BUS_GRP_NO, ORG__NAME, 

ORGHIER_ID FROM ORG__HIER) 

The first SQL segment to be generated is that for 

calculating tree sum and leaf sum aggregate values. These 
30 are later incorporated with the SQL segments that perform 

the hierarchy level and number of subordinates calculations 

and placed in a single SQL object. 

This requires the SQL code generated to perform 

multiple passes (queries) on the same data within a single 
35 query operation. This is achieved by separating the SQL 

segments that calculate the tree sum and leaf sum 

aggregates from the SQL segments that calulate the 
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hierarchy level and number of subordinates. The tree and 
leaf sum aggregation SQL segments are generated separately, 
in this case as SQL objects held in a PLSQL function. 
However, the skilled person will realise that other methods 
5 may be used. 

These functions are first generated separately then 
called by the SQL segment that calculates the hierarchy 
level. Therefore, during the execution of the hierarchy 
level SQL segment, for each row returned, the tree sum and 
10 leaf sum aggregation SQL segments are executed. This 
allows for multiple passes/queries to be performed on the 
same data. 

The SQL required to calculate the tree sum is 
different from that required to calculate the leaf sum. 

15 Other forms of value hierarchy aggregation can also be 
implemented using this methodology. 

Firstly, the generation of a SQL segment for 
calculating the tree sum aggregate will be described. If 
no values exist for User Input 6 then no tree sum SQL 

2 0 segment is generated. If there are values for User Input 
6 then a dynamic SQL statement is created for each value 
entered in User Input 6. This dynamic SQL accepts a 
parameter value ' Parameter_Input ' which refers to the item 
for which tree sum aggregation is to be performed. This 

2 5 parameter value is populated, when the function is called 
from the dynamically generated SQL segment that performs 
the calculation of hierarchy level described later. 

The following SQL statement constructs the SQL segment 
for performing the tree sum aggregation: 

30 

'SELECT SUM( ' | | User_input_6 | | • ) FROM ('11 <INSERT BASIC SQL 
OBJECT GENERATED ABOVE> [ | * ) CONNECT BY PRIOR 
• I |User__input_3 | | ' = 'I | User_input_2 | | • START WITH 
' I I User__input_3 | | ' = ' | | Parameter_Input | | ' ; ' 

35 

In the case of Example 1, since User Input 6 has two 
values (SAL and COMM) , executing this SQL statement 
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generates two SQL segments, one for the SAL column and one 
. for the COMM column. These are: 

SELECT SUM (SAL), FROM (SELECT MGR, EMPNO, SAL, COMM FROM 
5 EMP) CONNECT BY PRIOR EMPNO = MGR START WITH EMPNO = 
Parameter__Input ; 

and 

10 SELECT SUM (COMM), FROM (SELECT MGR, EMPNO, SAL, COMM FROM 
EMP) CONNECT BY PRIOR EMPNO = MGR START WITH EMPNO = 
Parameter__Input ; 

User Input 6 is not provided for Example 2 and so no 
15 SQL is generated for calculating leaf sum aggregates. 

The SQL segment generated above is wrapped in a 
function that is called for each row returned by the SQL 
that calculates the hierarchy level . The wrapper used in 
this case is a PLSQL function but other types of wrappers 

2 0 could be used. The PLSQL function wrapper is dynamically 

generated using the following SQL statement: 

CREATE OR REPLACE FUNCTION tree_' | | User_input_6 | | » 
(Parameter_Input IN varchar2) RETURN NUMBER IS 
25 tree_aggregate number (10); CURSOR cl ( CHILD IN var char 2). IS 
'||<INSERT ABOVE SQL SEGMENT FOR PERFORMING TREE SUM 
AGGREGATION> I | ' BEGIN OPEN cl (Parameter_Input ) ; FETCH cl 
INTO tree_aggregate; CLOSE cl; RETURN (tree_aggregate) ; END 
TREE_' I I User_input_6 I | ' ; 

30 

The function is then dynamically created against the 
database. In the case of Example 1, this generates two SQL 
segments, one for SAL and one for COMM: 

3 5 CREATE OR REPLACE FUNCTION tree_SAL ( Parameter_Input IN 

varchar2) RETURN NUMBER IS tree_aggregate number (10); 
CURSOR cl (CHILDIN varchar2 ) IS SELECT SUM (SAL), FROM 



m ■ ■ 

■ G- 

13 

(SELECT MGR, EMPNO, SAL, COMM FROM EMP) CONNECT BY PRIOR 
EMPNO = MGR START WITH EMPNO = Paramet er_Input ; BEGIN OPEN 
cl (Parameter_Input) ; FETCH cl INTO tree_aggregate; CLOSE 
cl; RETURN (tree_aggregate) ; END tree_SAL; 

5 

and 

CREATE OR REPLACE FUNCTION tree_COMM (Parameter_Input IN 
varchar2) RETURN NUMBER IS tree__aggregate number ( 10 ); 
10 CURSOR cl (childin varchar2) IS SELECT SUM (COMM), FROM 
(SELECT MGR, EMPNO, SAL, COMM FROM EMP) CONNECT BY PRIOR 
EMPNO = MGR START WITH EMPNO = Parameter_Input ; BEGIN OPEN 
cl (Parameter_Input) ; FETCH cl INTO tree_aggregate ; CLOSE 
cl; RETURN (tree__aggregate) ; END tree_COMM; 

15 

User Input 6 is not provided for Example 2 and so no 
SQL is generated for calculating leaf sum aggregates. 

The generation of the SQL segment for performing leaf 
sum aggregation is implemented in a similar way to that for 

2 0 performing tree sum aggregation although it is more 

complex. Again, if no values are entered for User Input 6 
then no leaf sum aggregation function is generated. If 
there are values f or ^ 'User Input 6' then a dynamic SQL 
statement is created for each value entered in User Input 
25 6- This dynamic SQL statement accepts a parameter value 
' Parameter_Input ' This parameter value is populated when 
the function is called from the dynamically generated SQL 
statement for calculating the hierarchy level, as described 
later, 

3 0 The following SQL statement constructs the SQL segment 

for performing the tree sum aggregation: 

SELECT SUM( ' | | User_input__6 | | ' ) FROM ( ' | | <INSERT BASIC SQL 
OBJECT GENERATED ABOVE > | | ' ) A WHERE ' \ | User_input_3 | | ' IN 
3 5 (select B. • | | User_input_3 | | ' FROM ( ' | | <INSERT BASIC SQL 
OBJECT GENERATED ABOVE> | | ' ) B WHERE NOT EXISTS (SELECT 
C. ' I I User_input_6 | | • FROM ( ' | | <INSERT BASIC SQL OBJECT 
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GENERATED ABOVE> | | ■ ) C WHERE C . ' | | User_input_2 | | ' = 
B . ' I I User_input_3 | | ' ) CONNECT BY PRIOR B . ' 1 1 

User_input_3 | | ' = ^ , • | I User_input_2 | 1 • START WITH 

B. ' I I User_input_3 | | ' = Parameter_Input ' ) ; 

5 

In the case of Example 1, since User Input 6 has two 
values (SAL and COMM) , this SQL statement generates two SQL 
segments, one for SAL and one for COMM. These are: 

10 SELECT SUM(SAL) FROM (SELECT MGR, EMPNO, SAL, COMM FROM 
EMP) A WHERE EMPNO IN (SELECT B. EMPNO FROM (SELECT MGR, 
EMPNO, SAL, COMM FROM EMP) B WHERE NOT EXISTS (SELECT 

C. SAL FROM (SELECT MGR, EMPNO, SAL, COMM FROM EMP) C 
WHERE C.MC3R = B. EMPNO) CONNECT BY PRIOR B. EMPNO = B.MGR 

15 START WITH B. EMPNO = Parameter_Input ) ; 

and 

SELECT SUM (COMM) FROM (SELECT MGR, EMPNO, SAL, COMM FROM 
20 EMP) A WHERE EMPNO IN (SELECT B. EMPNO FROM (SELECT MGR, 
EMPNO, SAL, COMM FROM EMP) B WHERE NOT EXISTS (SELECT 
C.COMM FROM (SELECT MGR, EMPNO, SAL, COMM FROM EMP) C 
WHERE C.MGR = B. EMPNO) CONNECT BY PRIOR B. EMPNO = B.MGR 
START WITH B . EMPNO = Parameter_Input ) ; 

25 

User Input 6 is not provided for Example 2 and so no 
SQL is generated for calculating tree sum aggregates. 

The SQL segments generated are wrapped in a function 
that is called for each row returned by the SQL segment 
3 0 that calculates the hierarchy level. The wrapper used in 
this case is a PLSQL function but other types of wrappers 
could be used. The PLSQL function wrapper is dynamically 
generated using the following SQL statement: 

35 CREATE OR REPLACE FUNCTION leaf_' 

I |User_input_6| I ■ (Parameter_Input IN varchar2) RETURN 
NUMBER IS leaf_aggregate number (10) ; CURSOR cl (CHILDIN 



varchar2) is ' | | < INSERT ABOVE SQL SEGMENT FOR PERFORMING 
LEAF SUM AGGREGATION> I | ' BEGIN OPEN cl (Parameter_Input ) ; 
FETCH cl INTO leaf _aggregate ; CLOSE cl; 

RETURN (leaf _aggregate) ; END leaf_' | | User_input_6 | | ' ; 

The function is then dynamically created against the 
database. In the case of Example 1, this generates two SQL 
segments, one for SAL and one for COMM: 

CREATE OR REPLACE FUNCTION leaf_SAL (Parameter_Input IN 
varchar2) RETURN NUMBER IS leaf _aggregate NUMBER (10) ; 
CURSOR cl (childin varchar2) IS SELECT SUM (SAL) FROM 
(SELECT MGR, EMPNO, SAL, COMM FROM EMP) A WHERE EMPNO IN 
(SELECT B. EMPNO FROM (SELECT MGR, EMPNO, SAL, COMM FROM 
EMP) B WHERE NOT EXISTS (SELECT C.SAL FROM (SELECT MGR, 
EMPNO, SAL, COMM FROM EMP) C WHERE C.MGR = B. EMPNO) CONNECT 
BY PRIOR B. EMPNO = B.MGR START WITH B . EMPNO 
PARAMETER_INPUT) ; BEGIN OPEN cl ( Paramet er_Input ) ; FETCH 
cl INTO leaf_aggregate; CLOSE cl; RETURN (leaf _aggregate) ; 
END leaf_SAL; 

CREATE OR REPLACE FUNCTION leaf_COMM (Parameter_Input IN 
varchar2) RETURN NUMBER IS leaf_aggregate NUMBER (10) ; 
CURSOR cl (childin varchar2) IS SELECT SUM (COMM) FROM 
(SELECT MGR, EMPNO, SAL, COMM FROM EMP) A WHERE EMPNO IN 
(SELECT B. EMPNO FROM (SELECT MGR, EMPNO, SAL, COMM FROM 
EMP) B WHERE NOT EXISTS (SELECT C . COMM FROM (SELECT MGR, 
EMPNO, SAL, COMM FROM EMP) C WHERE C.MGR = B. EMPNO) CONNECT 
BY PRIOR B. EMPNO = B.MGR START WITH B . EMPNO 
PARAMETER_INPUT) ; BEGIN OPEN Cl (Parameter_Input ) ; FETCH 
cl INTO leaf_aggregate; CLOSE cl; RETURN (leaf _aggregate) ; 
END leaf_COMM; 

User Input 6 is not provided for Example 2 and so no 
SQL is generated for calculating tree sum aggregates . 

Once the PLSQL functions have been generated they are 
created against the database . 



The construction of the SQL for calculating the 
hierarchy level incorporating the tree and leaf sum 
aggregate functions consists of generating two SQL select 
statements. The first is an SQL statement for calculating 
the hierarchy level and calling the aggregate functions. 
The second is an SQL statement for calculating the 
hierarchy level combined with the aggregate values. 

These SQL statements are then combined using inline 
views to create a single SQL statement that calculates the 
hierarchy level and the tree and leaf sum aggregate values. 
If no values are provided for User Input 6 then the SQL 
statements only calculate the hierarchy level. 

The first SQL statement is used to calculate the 
hierarchy level and call the dynamically created functions 
that calculate the tree and leaf sum aggregates. 

The number of aggregate functions created will depend 
on the number of values entered by the user for User Input 
6. The number of aggregation functions affects the dynamic 
creation of the SQL SELECT clause. The higher the number 
of aggregate items, the longer the SELECT clause will be. 
■There is a repeating pattern to the creation of this SELECT 
clause. Each aggregate item creates two functions, one to 
find the leaf sum aggregate and one to find the tree sum 
aggregate . 

The following string, known as the aggregate function 
string, is generated for each aggregate item entered into 
User Input 6 : 

tree_' | 1 User_input_6 | | ' ( ' | | User_input_3 | | ' ) ' AS 

I I User_input_6 1 | •_treesum" , leaf_' | | User_input_6 | | ' 

( - I I User_input_3 | ] ' ) ' AS " ' | | User_input_6 1 1 •_leaf sum" 

In the case of Example 1, this string is generated for 
each value of User Input 6, SAL and COMM, and the resultant 
two strings are concatenated: 



m 

17 

tree_SAL(EMPNO) AS " SAL_treesum" , leaf _SAL (EMPNO) AS 
" SAL_leaf sum" , tree__COMM (EMPNO) AS "COMM_treesum" , 
leaf_COMM(EMPNO) AS "COMM_leaf sum" , 

5 User Input 6 is not provided for Example 2 and so the 

strings are not generated. 

The following string, known as the key items string, 
is generated for each value entered for User Input 5: . 

10 I |User_input_5 I | ' AS " ' | | User_input_5 | | ' " 

-If no values are entered for User Input 5 then this 
string is not created. Since, User Input 5 is not provided 
for Example 1 and so the strings are not generated. 
15 With respect to Example 2, two values are provided for 

User Input 5 and so these two values are included in the 
string: 

BUS__GRP__NO AS "BUS_GRP_NO" , VERBID AS "VER_ID", 

20 

If a user has identified and input a primary key for 
User Input 7 then a string, known as the primary key 
string, is created as, follows: 

25 I |User_input_7| I • AS " ' | | User_input_7 | | ' ^k" , • 

If no values are entered for User Input 7 then this 
string is not created. 

In the case of Example 1, the following string is 
30 created: 

EMPNO AS "EMPNO__pk" 

In the case of Example 2, the following string is 
35 created: 

ORGHIER_ID as "ORGHIER__ID_pk" 
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The user is required to identify the start point of 
the hierarchy via User Input 4. If no value is given for 
this then a default value of NULL is assumed. However, if 
a value is given then a string, knovm as the start point 
string, is generated as follows: 

' = ' I I User_input_4 

If no value or NULL is entered then the string is generated 
as : 

'IS NULL" 

Therefore, with respect to Example 1, the following 



tring is created: 



s 



IS NULL 



In Example 2, the start point is defined as the value 
"0" and so the string is created as: 



= 0 

The next string, known as the CONNECT BY string, to be 
created is made up of the values entered for User Input 5 
concatenated with the parent items and the child items 
values entered for User Inputs 2 and 3 respectively as 
shown below: 

User_input_5 | | • 1 1 ' 1 | User_input_3 | | • 

= ' I I User_input_5 1 | ' | 1 ' | I User_input_2 

If no values exist for 'User Input 5' then the 
following string is created instead: 



User_input_3 | | ■ = ' | | User_input_2 
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If more than one value exists for 'User Input 5' then 
the strings created for each item are concatenated. 

With respect to Example 1, no. User Input 5 values 
5 exist and so the following string is created: 

EMPNO = MGR 

In Example 2, two values exist for User Input 5 and so 
10 two strings are created and concatenated: 

VER_ID| |BUS_GRP_NO| | EMPNO = VERBID | | BUS_GRP_NO | | MGR 

An SQL statement is then created from the strings 
15 created above as shown below: 

(SELECT LEVEL AS LVL, <INSERT KEY ITEMS STRING>, <INSERT 
AGGREGATE FUNCTIONS STRING> ,< INSERT PRIMARY KEY STRING > , ' 
I |User_input_2 | | ' AS " ' | | User_input_2 | | ' " , ' 

2 0 1 I User__input_3 | | » AS " • | | User_input_2 | | ' " FROM ' | | <INSERT 

BASIC SQL OBJECT GENERATED ABOVE > | | ' B CONNECT BY PRIOR ' 
<INSERT CONNECT BY STRING > " START WITH ' | | User_input_2 | | ' 
<INSERT START POINT STRING>) 

25 This SQL statement can then be validated against the 

database to check that the SQL syntax construction is 
correct . 

For Example 1, this creates the following SQL 

3 0 statement : 

(SELECT LEVEL AS LVL, TREE_SAL ( EMPNO) AS " SAL_TREESUM" , 
LEAF_SAL (EMPNO) AS " SAL_LEAFSUM" , TREE_COMM (EMPNO) AS 
"COMM_TREESUM" , LEAF_COMM (EMPNO) AS " COMM_LEAFSUM" , EMPNO 
3 5 AS "EMPNO_PK", MORAS "MGR" FROM (SELECT MGR, EMPNO, SAL, 
COMM FROM EMP) B CONNECT BY PRIOR (EMPNO = MGR) START WITH 
MGR IS NULL) 




For Example 2, the following SQL statement 
generated : 



(SELECT LEVEL AS LVL, BUS_GRP_NO AS "BUS_GRP_NO" , VER_ID AS 
"VER_ID", ORGHIER_ID AS "ORGHIER_ID_pk" , MGR AS "MGR" , 
EMPNO AS "EMPNO", FROM (SELECT MGR, EMPNO, VER_ID, 
BUS_GRP_NO, ORG_NAME, ORGHIER_ID FROM ORG_H I ER) B CONNECT BY 
PRIOR ( VER_ID 1 ! BUS_GRP_NO | | EMPNO=VER_ID 1 | BUS_GRP_NO | | MGR) 
START WITH MGR = 0) 

In order to construct the second SQL statement for 
calculating the hierarchy level combined with the aggregate 
values, a string is constructed that prepares the SQL 
generated statement generated above for joining with the 
SQL segment for calculating the number of subordinates 

described below. 

The second statement is comprised of several strings. 
The first of these strings is known as the outer aggregate 
values string. Its composition depends on the number of 
values entered into User Input 6. If no values are entered 
then the string is left empty. If one aggregate item is 
entered then the string is as follows: 

- I luser_input_6 | 1 •_treesum" , " • 1 | User_input_6 1 | '_leaf sum" 

If more than one value has been entered into User 
Input 6 then a string is created for each value and all 
the strings are concatenated but separated by commas. 

In Example 1, there are two values of User Input 6, 
SAL and COMM, so the resultant string is: 

"SAL_treesum" , "SAL_leaf sum" , "COMM_treesum" , "COMM_leaf sum" 

Example 2 has no values for User Input 6 and so no 

string is created. 

The next string to be created is the outer key 
values string. The creation follows the same pattern as 
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for the outer aggregate values string but is based on 
User Input 5. If User Input 5 values exist then the 
string is: . 

5 "'II User_input_5 | | ' " 

If more than one value exists then a string for each 
value is concatenated to form a comma separated string. 

Example 1 has no value entered for User Input 5 and 
10 so no string is created. 

Example 2 has two values for User Input 5 and so the 
following string is created: 

" BUS_GRP_NO " , " VER_I D " 

15 

The next string is known as the outer primary key 
string. It is generated following the same principal as 
the outer key values string but based on User Input 7. 
If User Input 7 values exist then the string is: 

20 

" I I User_input_7 I j '_pk" 

If User Input 7 has no value then no string is 
created. 

25 

For Example 1 the following string is created: 
"EMPNO_pk" 

3 0 For Example 2 the following string is created: 

"ORGHIER_ID_pk" 



35 



The four strings generated above are then combined 
to form the second SQL statement, known as the final 
outer level aggregation SQL statement, as shown below: 



(SELECT LVL, <INSERT OUTER KEY VALUES STRING > , <INSERT 
OUTER AGGREGATE VALUES STRING> , <INSERT OUTER PRIMARY KEY 
STRING> ■■||User_input_2l|" ' , ' " j 1 User_input_3 1 | " ■ 

FROM <INSERT BASIC SQL OBJECT GENERATED ABOVE > 1 | ' B 
CONNECT BY PRIOR ' <INSERT CONNECT BY STRING > - START 
WITH • I |User_input_2l 1 • <INSERT START POINT STRING>) X 

This SQL statement can be run or validated, at this 
point, against the database to check for syntactical 
errors . 

For Example 1, the second SQL statement is generated 

■i 

as follows: 

(SELECT LVL, "SAL_TREESUM-' , "SAL_LEAFSUM" , 
..COMM_TREESUM", " COMM_LEAFSUM" , ■■EMPNO_PK" , "MGR" , 
"EMPNO" FROM (SELECT LEVEL AS LVL, TREE_SAL (EMPNO) AS 
••SAL_TREESUM", LEAF_SAL (EMPNO) AS " SAL_LEAFSUM " , 
TREE_COMM (EMPNO) AS " COMM_TREESUM" , LEAF_COMM (EMPNO) AS 
"COMM_LEAFSUM", EMPNO AS "EMPNO_PK" , MGR, EMPNO FROM 
(SELECT MGR, EMPNO, SAL, COMM FROM EMP) B CONNECT BY 
PRIOR EMPNO = MGR START WITH MGR IS NULL) X 

Example 2 requires no aggregation and so the 
following statement is generated:- 

(SELECT LVL, " BUS_GRP_NO " , "VER_ID" , •'ORGHIER_ID_PK" , 
"MGR", "EMPNO" FROM (SELECT LEVEL AS LVL, BUS_GRP_NO AS 
"BUS_GRP_NO", VER_ID AS "VER_ID" , ORGHIER_ID AS 
..ORGHIER_ID_pk", MORAS "MGR", EMPNO AS "EMPNO", FROM 
(SELECT MGR, EMPNO, VER_ID, BUS_GRP_NO, ORG_NAME, 
ORGHIER_ID FROM ORG_HIER) B CONNECT BY PRIOR 

( VER_ID I I BUS_GRP_NO | | EMPNO=VER_ID i | BUS_GRP_NO | | MGR) START 
WITH MGR = 0 ) X 

It is now necessary to construct the SQL segment for 
calculating the number of subordinates for each record in 
the hierarchy. 
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In order to do this, a string known as the 
subordinates connect by string is first created from .the 
values of User Input 5 concatenated with the values of 
the parent items and the child items from User Inputs 2 
5 and 3 respectively- The string is created as shown 
below : 

User_input_5 | | ' | | ' | | User_input_2 | | * = 
' I I User__input_5 | | ' | I ' I 1 User_input_3 

10 

If no values exist for 'User Input 5' then the 
following string is created instead: 

User_input_2 I I • = • | |User_input_3 

15. 

If more than one value exists for 'User Input 5' 
then the strings created for each item are concatenated. 

With respect to Example 1, no User Input 5 values 
exist and so. the following string is created: 

20 

(MGR = EMPNO) 

In Example 2, two values exist for User Input 5 and 
so two strings are created and concatenated: 

25 

(VER_ID| |BUS_GRP_N0| |MGR=VER_ID| I BUS__GRP_N0 I I EMPNO) 

An SQL statement, known as the subordinates select 
SQL statement, is then created from this string to find 
3 0 the number of aggregates at each of the node points in 
the hierarchy: 

' (SELECT SUBORDINATES, ' | | User_input_5 | ' , ' 
I |User_input__2 | | ' , ' | | User_input_3 | | ' , ' | | User_input_7 | | 
35 FROM ( SELECT COUNT (*) - 1 AS SUBORDINATES,.' 
I |User_input_5 | | ' , ' | | User_input_2 | | ^ ' 

I |User_input_3 | | ' / ' | | User_input_7 | | » FROM ( ' | | <INSERT 
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BASIC SQL OBJECT GENERATED ABOVE> 1 | ' ) B CONNECT BY 
PRIOR' <INSERT SUBORDINATES CONNECT BY STRING GROUP BY 
' I |User_input_5 | * , ' | j User_input_2 | | * / ' 
I I User_input_3 | | ' , ' | | User_input_7 1 | ' ) ) Y » 

5 

If User Input 5 has no values then it is omitted 
from the SQL statement shown above. If User Input 5 
contains more than one value, each value is included but 
they are separated by commas . 
10 If User Input 7 has no values or if it is identical 

to User Input 3 then it is omitted from the SQL statement 
shown above. 

This SQL statement can be run or validated, at this 
point, against the database to check for syntactical 
15 errors. 

In the case of Example 1, the SQL segment for 
• calculating the number of subordinates is: 

(SELECT SUBORDINATES, MGR, EMPNO, FROM (SELECT 
2 0 COUNT (*)-l AS SUBORDINATES, MGR,EMPNO FROM (SELECT MGR, 
EMPNO, SAL, COMM FROM EMP) B CONNECT BY PRIOR (MGR=EMPNO) 
GROUP BY MGR, EMPNO) ) Y 

In the case of Example 2, the corresponding SQL 

2 5 segment is: 

(SELECT SUBORDINATES, BUS_GRP_NO, VER_ID, MGR, EMPNO, 
ORGHIER_ID FROM (SELECT COUNT (*)-l AS SUBORDINATES, 
BUS_GRP_NO, VER_ID,MGR, EMPNO, ORG_HIER_ID FROM (SELECT 

3 0 MGR, EMPNO, VER__ID, BUS_GRP__NO, ORG_NAME, ORGHIER_ID 

FROM ORG_HIER)B CONNECT BY PRIOR 

(VERBID I |BUS_GRP_NO| | MGR= VERBID | | BUS_GRP_NO | j EMPNO) GROUP 
BY BUS_GRP_NO, VER_ID, MGR, EMPNO, ORGHIER_ID) ) Y 

35 As previously described, the SQL segments generated 

for calculating the tree and leaf sum aggregates, the 
level and number of subordinates are combined into a 



final SQL statement. In order to do this, two strings 
are first created. 

The first string, known as the key select string is 
created from the values of User Input 7 provided. It is 
created as: 

• " I |User_input_7 | | »_pk" ' 

However, if there is no value for User Input 7 then 
the key select string is: 

'"II User_input_3 | | ' " , " ' || User_input_5 || ' " / ' | I 

If User Input 5 contains more than one value then 
each value is concatenated in a comma separated format. 

If there are no values for both User Input 7 and 
User Input 5 then the key select string is: 

» " I I User_input_3 | | ' " 

In the case of Example 1, the key select string is: 
" EMPNO_pk" 

In the case of Example 2, it is: 
" ORGHIER_ID_pk" 

In both examples values of User Input 7 exist, but 
to illustrate the point, if they did hot, then the 
strings would be instead: 

Example 1: "EMPNO" 

Example 2: "EMPNO", "BUS_GRP_NO" , "VERBID" 



The next string is known as the where clause string. 
It is constructed from the input values provided as shown 
below: 

WHERE X. " • I |User_input_2 | | ' " = y. ' | | User_input_2 | | ' ( + ) 
AND X. " • I |User_input_3 | | ' " = y- " | | User_input__3 | | M + ) " AND 
X. " • I I User_input_5 | | ' " = y. M | User_input_5 | p ( + ) * 

If no values exist for User Input 5 then the string 
is modified such that it does not refer to this as is 
shown below with respect to Example 1. If User Input 5 
has more than one value then each value is repeated and 
concatenated to the end of the string as is shown below 
with respect to Example 1: 

In the case of Example 1, where no User Input 5 
values exist, the where clause string is: 

'WHERE x."MGR" = y.MGR(+) AND x."EMPNO" = y.EMPNO(+)' 

In the case of Example 2, where two User Input 5 
values are provided, the where clause string is: 

'WHERE x-"MGR" = y.MGR( + ) AND x. "EMPNO" = y.EMPNO( + ) AND 
X. "VERBID" = y. VERBID ( + ) AND x. "BUS_GRP_NO" = 
y , BUS_GRP__NO ( + ) ' 

All the dynamically created SQL segments together 
with the key select and where clause strings are combined 
into a single SQL statement as shown: 

'SELECT LVL, SUBORDINATES, <INSERT OUTER AGGREGATE VALUES 
STRING>, <INSERT KEY SELECT STRING> FROM <INSERT FINAL 
OUTER LEVEL AGGREGATION SQL STATEMENT>, <INSERT 
SUBORDINATES SELECT SQL STATEMENT > < INSERT WHERE CLAUSE 
STRING> 
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This dynamic SQL statement once created can be run. 
or validated on the database to check for syntactical 
errors . 

5 In the case of Example 1, the following final SQL 

statement is generated: 

SELECT LVL, SUBORDINATES, " SAL_TREESUM" , " SAL_LEAFSUM " , 
"COMM_TREESUM" , "COMM_LEAFSUM" , "EMPNO_PK" FROM (SELECT 

10 LVL, "SAL_TREESUM" , " SAL_LEAFSUM " , " COMM_TREESUM" , 

"COMM_LEAFSUM" , "EMPNO_PK" , "MGR" , "EMPNO" FROM (SELECT 
LEVEL AS LVL, TREE_SAL (EMPNO) AS " SAL_TREESUM" , 
LEAF_SAL (EMPNO) AS " S7yL_LEAFSUM " , TREE_COMM (EMPNO) AS 
"COMM_TREESUM" , LEAF_COMM (EMPNO) AS "COMM_LEAFSUM" , EMPNO 

15 AS "EMPNO_PK", MGR, EMPNO FROM (SELECT MGR, EMPNO, SAL, 
COMM FROM EMP) B CONNECT BY PRIOR EMPNO = MGR START WITH 
MGR IS NULL) X (SELECT SUBORDINATES, MGR, EMPNO, FROM 
(SELECT COUNT (*)-l AS SUBORDINATES, MGR, EMPNO FROM 
(SELECT MGR, EMPNO, SAL, COMM FROM EMP) B CONNECT BY 

20 PRIOR (MGR=EMPNO) GROUP BY MGR, EMPNO) ) Y> 'WHERE X."MGR" 
= Y.MGR(+) AND X. "EMPNO" = Y.EMPNO(+)' 

For Example 2, the final SQL statement is: 

2 5 SELECT LVL, SUBORDINATES, "ORGHIER_ID_PK" FROM (SELECT 

LVL, "BUS_GRP_NO" , "VER_ID" , "ORGHIER_ID_PK" , "MGR", 
"EMPNO" from (SELECT LEVEL AS LVL, BUS_GRP_NO AS 
"BUS_GRP_NO" , VER_ID AS "VER_ID" , ORGHIER_ID AS 
"ORGHIER_ID_P.K" , MGR AS "MGR", EMPNO AS "EMPNO", FROM 

3 0 (SELECT MGR, EMPNO, VER_ID, BUS_GRP_NO, ORG_NAME FROM 

ORG_HIER)B CONNECT BY PRIOR 

(VER_ID| |BUS_GRP_NO| | EMPNO =VER_ID | | BUS_GRP_NO | | MGR) START 
WITH MGR = 0 ) X (SELECT SUBORDINATES, BUS_GRP_NO, 
VER_ID, MGR, EMPNO, ORGHIER_ID FROM (SELECT COUNT (*)-l 
35 AS SUBORDINATES, BUS_GRP_NO, VER_ID,MGR, EMPNO, 

ORGHIER_ID FROM (SELECT MGR, EMPNO, VER_ID, BUS_GRP_NO, 
ORG NAME, ORGHIER_ID FROM ORG_HIER) B CONNECT BY. PRIOR 



28 

(VER_ID| I BUS__GRP___NO I | MGR=VER__ID | | BUS_GRP_NO | |EMPN0) GROUP 
BY BUS_GRP_NO, VERBID, MGR, EMPNO, ORGHIER_ID) ) Y WHERE 
X."MGR" = Y.MGR( + ) AND X."EMPNO" = Y.EMPNO( + ) AND 
X."VER__ID" = Y,VER_ID( + ) AND X . " BUS_GRP_NO " = 
5 Y . BUS_GRP_NO ( + ) 

The final SQL statement, once it has been created, 
can be hidden inside a database object such as a view, 
using the following SQL statement: 

10 

'CREATE VIEW | | User_input_l | | '_VH AS (<INSERT FINAL SQL 
STATEMENT STRING >) ; • 

This view can then be dynamically generated and 
15 treated like a table and be used by analysis tools (such 
as Discoverer) to extract information on hierarchy 
levels, number of subordinates, hierarchical node 
aggregation and node identification (leaf or tree) . This 
can be done without the analysis tool having to generate 
2 0 any special SQL. syntax, for example using CONNECT BY or 
START WITH clauses 

For Example 1, a seven-column view is created as 
defined below: 

2 5 CREATE VIEW EMP_VH AS (SELECT LVL, SUBORDINATES, 

" SAL__TREESUM " , " SAL_LEAFSUM " , " COMM__TREESUM " , 
"COMM_LEAFSUM" , "EMPNO_PK" FROM (SELECT LVL, 
" SAL_TREESUM " , " SAL_LEAFSUM " , " COMM^TREESUM " , 
*»COMM__LEAFSUM" , "EMPNO_PK" , "MGR" , "EMPNO" FROM (SELECT 

3 0 LEVEL AS LVL, TREE_SAL (EMPNO) AS " SAL_TREESUM" , 

LEAF_SAL ( EMPNO) AS " SAL_LEAFSUM " , TREE_COMM ( EMPNO ) AS 
"COMM_TREESUM" , LEAF_COMM (EMPNO) AS " COMM_LEAFSUM" , EMPNO 
AS "EMPNO_PK", MGR, EMPNO FROM (SELECT MGR, EMPNO, SAL, 
COMM FROM EMP) B CONNECT BY PRIOR EMPNO = MGR START WITH 
35 MGR IS NULL) X (SELECT SUBORDINATES, MGR, EMPNO, FROM 
(SELECT COUNT (*)-l AS SUBORDINATES, MGR, EMPNO FROM 
(SELECT MGR, EMPNO, SAL, COMM FROM EMP) B CONNECT BY 
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PRIOR (MGR=EMPNO) GROUP BY MGR, EMPNO) ) Y> 'WHERE X."MGR" 
= Y.MGR( + ) AND X. "EMPNO" = Y. EMPNO ( + ) ') 

The seven column view will contain the following 
5 columns : 



Column 
Number 


Coliimn Name 


Description 


1 


EMPNO_PK 


Primary Key 


2 


LVL 


A pseudo column that interrogates 
the hierarchy to find, for each 
row of data, its level within the 
hierarchy 


3 


SUBORDINATES 


The number of subordinate rows of 
data for this record within the 
hierarchy 


4 


SAL_TREESUM 


Tree sum aggregation for SAL 


5 


SAL_LEAFSUM 


Leaf sum aggregation for SAL 


6 


COMM_TREESUM 


Tree sum aggregation for COMM 


7 


COMM_LEAFSUM 


Leaf sum aggregation for COMM 



If desired, this seven column view can then be 
joined back the EMP table shown in Figure 1 using a one 
to one join between the EMPNO column of EMP and EMPNO_PK 
20 column of the view. A user is then able to select an 

employee and extract, via the join, the employees level, 
number of subordinates, SAL tree and leaf sum and COMM 
leaf and tree sum without the user having to enter any 
complex SQL. 

25 In the case of Example 2 a three column view is 

created: 

CREATE VIEW ORG_HIER_VH AS (SELECT LVL, SUBORDINATES, 
"ORGHIER_ID_PK" FROM (SELECT LVL, "BUS_GRP_NO" , 
30 ^ "VER_ID", "ORGHIER_ID_PK" , "MGR" , "EMPNO" FROM (SELECT 
LEVEL AS LVL, BUS GRP NO AS "BUS GRP_NO" , VERBID AS 
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"VER_ID", ORGHIER_ID AS "ORGHIER_ID_PK" , MGR AS "MGR" , 
EMPNO AS "EMPNO", FROM (SELECT MGR, EMPNO, VER_ID, 
BUS_GRP_NO, ORG_NAME, ORGHIER_ID FROM ORG_HIER) B CONNECT 
BY PRIOR 

5 (VERBID I I BUS__GRP___NO | | EMPNO=VER_ID | | BUS_GRP__NO | | MGR) START 
WITH MGR = 0 ) X (SELECT SUBORDINATES, BUS_GRP_NO, 
VERBID, MGR, EMPNO, ORGHIER_ID FROM (SELECT COUNT (*)-! 
AS SUBORDINATES, BUS_GRP_NO, VER_ID,MGR, EMPNO, 
ORGHIER__ID FROM (SELECT MGR, EMPNO, VERBID, BUS_GRP_NO, 

10 ORG^NAME, ORGHIER_ID FROM ORG_HIER) B CONNECT BY PRIOR 

(VERBID I I BUS_GRP_NO | | MGR=VER_ID | | BUS_GRP_NO [ | EMPNO) GROUP 
BY BUS_GRP_NO, VER_ID, MGR, EMPNO, ORGHIER_ID) ) Y WHERE 
X."MGR" = Y.MGR(+) AND X. "EMPNO" = Y.EMPNO(+) AND 
X."VER_ID" = Y.VER_ID( + ) AND X . " BUS_GRP_NO " = 

15 Y.BUS GRP NO( + ) ) 



The three column view will contain the following 
columns : 



ColiJinn 
Nioinber 


ColiJinn Name 


Description 


1 


ORGHIER_ID__PK 


Primary Key 


2 


LVL 


Pseudo column that interrogates 
the hierarchy to find, for each 
row of data, its level within 
the hierarchy 


3 


SUBORDINATES 


The number of subordinate rows 
of data for this record within 
the hierarchy 



This view can then be joined to. the ORG_HIER table 
shown in Figure 6 using a one to one join between the 
ORGHIER_ID column of ORG_HIER table and the ORGHIER_ID__PK 
column of the view. A user can then extract level and 
30 number of subordinates data for each record in ORG_HIER. 



CLAIMS 



1. A computer implemented method for extracting 
hierarchical data from one or more tables in a database, 
the method comprising: 

a- receiving user input identifying, at least, one 
or more tables in the database that contain the 
hierarchical data and the hierarchical 
structure of the hierarchical data; 

b. generating a Structured Query Language (SQL) 
statement using the user input received in step 
(a) ; 

c. presenting the SQL statement to the database 
for execution; and, 

d. extracting the hierarchical data from the 
database in response to the execution of the 
SQL statement. 

2. A method according to claim 1, wherein the user 
input identifying the hierarchical structure indicates at 
least a column of one of the database tables that 
represents the parent items of the hierarchical structure 
and a column of one of the database tables that 
represents the child items of the hierarchical structure. 

3. A method according to claim 2, wherein the data 
value that is the top of the hierarchical structure has a 
default value of NULL. 

4. A method according to claim 2, wherein the user 
input identifying the hierarchical structure indicates 
the data, value in. the .column representing the parent 
items that is the top of the hierarchical structure. 

5. A method according to any of claims 2 to 4, wherein 
the user input identifying the hierarchical structure 
indicates at least one further column of one of the 
tables that further restricts the hierarchy. 

6. A method according to any of the preceding claims, 
wherein, prior to generating the SQL statement, further 
user input is received, the user input indicating at 
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least one column of one of the database tables on which 
leaf sum aggregation should be performed. 

7. A method according to any of the preceding claims, 
wherein, prior to generating the SQL statement, further 

5 user input is received, the user input indicating at 

least one column of one of the database tables on which 
tree sum aggregation should be performed, 

8. A method according to any of the preceding claims, 
wherein the SQL statement is generated from the user 

10 input by: 

i. constructing object SQL that will return 
all the database items identified by the 
user input ; 

ii. constructing object SQL that calculates 
15 the level of an item in the hierarchical 

structure ; 

iii. constructing object SQL that calculates 
the number of child items appendant to an 
item in the hierarchical structure; and, 

2 0 iv. combining the object SQL of steps (i) to 

(iii) in a single object SQL statement. 

9 . A method according to claim 8 when dependent on 
claim 6, wherein further object SQL that performs the 
leaf sum aggregation is constructed and this further 

25 object SQL is combined with the object SQL of steps (i) 
to (iii) in step (iv) . 

10. A method according to claim 8 or claim 9 when 
dependent on claim 7, wherein further object SQL that 
performs the tree sum aggregation is constructed and this 

3 0 further object SQL is combined with the object SQL of 

steps (i) to (iii) in step (iv) . 

11. A computer program comprising computer program code 
means adapted to perform the steps of any of the 
preceding claims when said program is run on a computer. 

35 12. A computer program product comprising program code 

means stored on a computer readable medium for performing 
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the method of any of claims 1 to 10 when said program 
product is run on a computer. 

13 . A method substantially as hereinbefore described 
with reference to the accompanying drawings. 
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ABSTRACT 

HIERARCHICAL DATA EXTRACTION 

A computer implemented method is provided for 
extracting hierarchical data from one or more tables in a 
database, the method comprises firstly receiving user 
input identifying, one or more tables in the database 
that contain the hierarchical data and the hierarchical 
structure of the hierarchical data. A Structured Query 
Language (SQL) statement is then generated using the user 
input received. The SQL statement is presented to the 
database for execution and the hierarchical data are then 
extracted from the database in response to the execution 
of the SQL statement . 

(Figure 6) 
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